Feature spline regularization #1222

Doresic · 2023-11-29T14:31:06Z

The main focus of the PR: add spline regularization into the spline approximation method.
Other changed stuff:

Due to a theoretical result that the spline parameters do not contribute to the outer gradient calculation, now I can delete all of that unnecessary code 😅
Small ensemble change: for some reason the parameter_ids_ensemble list was not a list, so was failing at the .index call.
Improvement of the calculate_quantitative_result function of the inner_calculator_collector. It was giving all nan values, because the summation in the gradient calculation was not a np.nansum. That's fixed now.
Fixed extraction of sensitivities of optimization parameters in the gradient calculation in the inner_calculator_collector and spline_approximation.solver gradient calculation. Previously, it was using par_sim_idx as the index for rdata. However, it can happen that for some conditions not all parameters are used (i.e. observable parameters). In that case, edatas has a plist of indices of parameters used for that condition, which has to be used as indices for the rdata object.
In the importer's create_objective function, if there are non-quantitative data types, the max_sensi_order should be set to 1, as higher-order derivatives are not implemented.
Small style change in parameters.py related to hierarchical parameter plotting.
Added spline regularization to the spline plotting function.

Note: Still have TODO tests of the regularization.

Initial commit with spline regularization. The implementation of the gradient wrt theta is not completed (swapping to develop for some debugging of censored). Additionally, small fixes of censored gradient (sigma related, not important usually).

Implemented the complete gradient, including the regularization term for the ds_dtheta

FIxed the parameter plot issue (will be in a separate PR). Added the linear regularization to the spline visualization.

Changes that I might have to revert for the Pull request. The ensemble one might've already been pushed by Polina. The solver one completely circumvents dsdtheta gradient calculation due to some "array cannot be inf or NaN" error which pops sometimes, but it's mostly always 0 so removing it makes it easier to deal with for now. This is a big TODO, do not forget.

Fixed the par_sim_idx in the spline solver calculate gradient function. For all conditions, the rdata form is not managed by par_sim_idx, but with par_edata_indices. This is done such that if some parameters are not used in the condition (observable or noise parameters for instance) then the sensitivities wrt. them don't have to be calculated

Added the passing of the edata indices from the calculator to the solver gradient calculation. Additionally, fixed the inner calculator collector quantitative calculation with nansum instead of a regular one (sometimes it gave only nan values because of this...)

Since I've proven that the ds_dtheta gradient contribution is always 0, we can remove all of the code that was calculating it

codecov-commenter · 2023-11-29T14:33:09Z

Codecov Report

Attention: 389 lines in your changes are missing coverage. Please review.

Comparison is base (160c2a8) 88.16% compared to head (03e6ef0) 83.81%.
Report is 469 commits behind head on develop.

Files	Patch %	Lines
pypesto/ensemble/ensemble.py	69.77%	68 Missing ⚠️
...ypesto/hierarchical/spline_approximation/solver.py	82.62%	53 Missing ⚠️
pypesto/ensemble/util.py	65.13%	38 Missing ⚠️
pypesto/hierarchical/optimal_scaling/solver.py	93.53%	26 Missing ⚠️
pypesto/history/base.py	88.57%	24 Missing ⚠️
pypesto/hierarchical/petab.py	84.56%	23 Missing ⚠️
pypesto/engine/mpi_pool.py	0.00%	22 Missing ⚠️
pypesto/hierarchical/inner_calculator_collector.py	88.51%	17 Missing ⚠️
...pesto/hierarchical/spline_approximation/problem.py	89.18%	16 Missing ⚠️
pypesto/hierarchical/problem.py	89.23%	14 Missing ⚠️
... and 16 more

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1222      +/-   ##
===========================================
- Coverage    88.16%   83.81%   -4.36%     
===========================================
  Files           79      148      +69     
  Lines         5257    11948    +6691     
===========================================
+ Hits          4635    10014    +5379     
- Misses         622     1934    +1312

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

- Updated the notebook for non-linear semi-quantitative data - fixed the max_sensi_order problem - changed default min_diff_factor - fix quality

Doresic · 2023-11-30T13:11:10Z

Tests done 👍

dweindl

I'm not sufficiently familiar with the method, therefore just some general comments.

Please add return type annotations where missing.
https://pypesto--1222.org.readthedocs.build/en/1222/example/example_nonlinear_monotone.html
- The visualize_optimized_model_fit figure is hardly readable
- Not sure whether there is a pypesto policy no notebooks, but I'd prefer clearing outputs before merging.

pypesto/ensemble/ensemble.py

pypesto/hierarchical/inner_calculator_collector.py

pypesto/hierarchical/spline_approximation/solver.py

pypesto/visualize/parameters.py

pypesto/visualize/spline_approximation.py

test/hierarchical/test_spline.py

pypesto/visualize/parameters.py

pypesto/visualize/spline_approximation.py

stephanmg · 2023-12-01T11:24:05Z

General suggestion: There seems to be some files (in the hierarchical folder) which have below 80% test coverage, perhaps you can improve this with meaningful tests. If not also fine.

FFroehlich

Please add CODEOWNERS entry for doc/example/example_nonlinear_monotone.ipynb

only looked at pypesto/petab/importer.py

pypesto/petab/importer.py

Added tests for untested lines in calculators

PaulJonasJost

There are merge conflicts that need to be resolved. Otherwise approving since there are already extensive reviews. Thanks 👍🏼

m-philipps

Would you consider adding a test for the pypesto/visualize/spline_approximation.py? Can be in a later PR to merge this soon.

Doresic · 2023-12-07T09:49:07Z

Would you consider adding a test for the pypesto/visualize/spline_approximation.py? Can be in a later PR to merge this soon.

Yes, that would be good to have. I'll add tests in another PR.

Doresic added 13 commits July 13, 2023 13:33

Initial, missing regularization grad

96dc80a

Initial commit with spline regularization. The implementation of the gradient wrt theta is not completed (swapping to develop for some debugging of censored). Additionally, small fixes of censored gradient (sigma related, not important usually).

Merge branch 'develop' into feature_spline_regularization

67b09bc

Merge branch 'develop' into feature_spline_regularization

5a6e435

Complete gradient

abf0fcc

Implemented the complete gradient, including the regularization term for the ds_dtheta

Fix par plot, add reg to spline plot

962875d

FIxed the parameter plot issue (will be in a separate PR). Added the linear regularization to the spline visualization.

Remove ds_dtheta_term calculation

947c457

Since I've proven that the ds_dtheta gradient contribution is always 0, we can remove all of the code that was calculating it

Merge branch 'develop' into feature_spline_regularization

c3661da

Remove redundancy

375a2ab

Small cleanup

f737b07

Remove a mistake

831d8b2

Doresic added 4 commits November 29, 2023 17:00

Add regularization test

f53720d

Quality test fix

e0a5772

Spline tests fix + obj fun fix

9b3afa3

Notebook update

7dd6188

- Updated the notebook for non-linear semi-quantitative data - fixed the max_sensi_order problem - changed default min_diff_factor - fix quality

Doresic marked this pull request as ready for review November 30, 2023 14:02

Doresic requested review from dweindl, FFroehlich, dilpath, PaulJonasJost and a team as code owners November 30, 2023 14:02

dweindl requested a review from stephanmg November 30, 2023 14:50

dweindl approved these changes Nov 30, 2023

View reviewed changes

Daniel review changes

99d0ab7

dilpath approved these changes Nov 30, 2023

View reviewed changes

pypesto/visualize/parameters.py Show resolved Hide resolved

pypesto/visualize/spline_approximation.py Show resolved Hide resolved

pypesto/visualize/spline_approximation.py Show resolved Hide resolved

pypesto/visualize/spline_approximation.py Show resolved Hide resolved

Merge branch 'develop' into feature_spline_regularization

b927d59

stephanmg approved these changes Dec 1, 2023

View reviewed changes

FFroehlich approved these changes Dec 4, 2023

View reviewed changes

pypesto/petab/importer.py Show resolved Hide resolved

Dilan&Fabian review changes

0cc1ad5

Doresic requested a review from m-philipps as a code owner December 4, 2023 12:29

Improve test coverage

092de9b

Added tests for untested lines in calculators

PaulJonasJost approved these changes Dec 6, 2023

View reviewed changes

Merge branch 'develop' into feature_spline_regularization

7edc504

m-philipps approved these changes Dec 6, 2023

View reviewed changes

Merge branch 'develop' into feature_spline_regularization

d394f8f

Merge branch 'develop' into feature_spline_regularization

03e6ef0

Doresic merged commit 530a044 into develop Dec 7, 2023
18 checks passed

Doresic deleted the feature_spline_regularization branch December 7, 2023 10:32

This was referenced Jan 30, 2024

Release v0.4.2 #1298

Merged

Prepare release v0.4.2 #1299

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature spline regularization #1222

Feature spline regularization #1222

Doresic commented Nov 29, 2023

codecov-commenter commented Nov 29, 2023 •

edited

Loading

Doresic commented Nov 30, 2023

dweindl left a comment

stephanmg commented Dec 1, 2023

FFroehlich left a comment

PaulJonasJost left a comment

m-philipps left a comment

Doresic commented Dec 7, 2023

Feature spline regularization #1222

Feature spline regularization #1222

Conversation

Doresic commented Nov 29, 2023

codecov-commenter commented Nov 29, 2023 • edited Loading

Codecov Report

Doresic commented Nov 30, 2023

dweindl left a comment

Choose a reason for hiding this comment

stephanmg commented Dec 1, 2023

FFroehlich left a comment

Choose a reason for hiding this comment

PaulJonasJost left a comment

Choose a reason for hiding this comment

m-philipps left a comment

Choose a reason for hiding this comment

Doresic commented Dec 7, 2023

codecov-commenter commented Nov 29, 2023 •

edited

Loading